AITopics | van roy

517da335fd0ec2f4a25ea139d5494163-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 21:42:15 GMT

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)

Add feedback

Finite-Time Logarithmic Bayes Regret Upper Bounds

Neural Information Processing SystemsApr-24-2026, 19:33:57 GMT

We derive the first finite-time logarithmic Bayes regret upper bounds for Bayesian bandits. In a multi-armed bandit, we obtain O(c logn)and O(ch log2 n)upper bounds for an upper confidence bound algorithm, where ch and c are constants depending on the prior distribution and the gaps of bandit instances sampled from it, respectively. The latter bound asymptotically matches the lower bound of Lai (1987). Our proofs are a major technical departure from prior works, while being simple and general. To show the generality of our techniques, we apply them to linear bandits. Our results provide insights on the value of prior in the Bayesian setting, both in the objective and as a side information given to the learner. They significantly improve upon existing O( n)bounds, which have become standard in the literature despite the logarithmic lower bound of Lai (1987).

bandit, data mining, machine learning, (22 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.14)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

5141f6bc105d30edbae48f1d2e0b1e66-Paper-Conference.pdf

Neural Information Processing SystemsFeb-19-2026, 03:16:02 GMT

agent, joint prediction, prediction, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)

Add feedback

Scalar Posterior Sampling with Applications

Georgios Theocharous, Zheng Wen, Yasin Abbasi Yadkori, Nikos Vlassis

Neural Information Processing SystemsFeb-14-2026, 10:31:18 GMT

Peter L learning UAI, pages Dimitri Dynamic, Belmont, Ronen I optimal Journal Aditya processes.

artificial intelligence, machine learning, ouyangetal, (12 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback

AgnosticQ-learningwithFunctionApproximationin DeterministicSystems: Near-OptimalBoundson ApproximationErrorandSampleComplexity

Neural Information Processing SystemsFeb-11-2026, 06:16:25 GMT

Therefore, we help address the open problem on agnosticQ-learning proposed in [Wen and Van Roy,2013].

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.49)

Add feedback

e769e03a9d329b2e864b4bf4ff54ff39-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-10-2026, 21:56:27 GMT

gurobi, imitation learning, solver, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.31)

Add feedback

2a568a9a84577769d838793433c817d9-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 01:13:49 GMT

algorithm, ensemble, linear ensemble, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

517da335fd0ec2f4a25ea139d5494163-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 16:06:55 GMT

Itisoften the responsibility of the agent designer toconstruct thistargetwhich,inrichandcomplexenvironments,constitutesaonerousburden; without full knowledge of the environment itself, a designer may forge a suboptimal learning target that poorly balances the amount ofinformation an agent must acquire to identify the target against the target's associated performance shortfall.

agent, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County > Palo Alto (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback